Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 266824 |
| Missing cells | 1226294 |
| Missing cells (%) | 20.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 46.8 MiB |
| Average record size in memory | 184.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 15 |
sueldo_smdlv is highly correlated with otros_ingresos_smdlv and 1 other fields | High correlation |
otros_ingresos_smdlv is highly correlated with sueldo_smdlv | High correlation |
año_credito is highly correlated with sueldo_smdlv | High correlation |
sueldo_smdlv is highly correlated with otros_ingresos_smdlv and 1 other fields | High correlation |
otros_ingresos_smdlv is highly correlated with sueldo_smdlv | High correlation |
año_credito is highly correlated with sueldo_smdlv | High correlation |
municipio_residencia is highly correlated with municipio_credito and 2 other fields | High correlation |
periodo_credito is highly correlated with estado_final and 2 other fields | High correlation |
estado_final is highly correlated with periodo_credito and 1 other fields | High correlation |
sueldo_smdlv is highly correlated with otros_ingresos_smdlv | High correlation |
municipio_credito is highly correlated with municipio_residencia and 2 other fields | High correlation |
municipio_expedicion is highly correlated with municipio_residencia and 2 other fields | High correlation |
cuotas is highly correlated with forma_pago | High correlation |
genero is highly correlated with Row | High correlation |
forma_pago is highly correlated with estado_final and 3 other fields | High correlation |
año_credito is highly correlated with periodo_credito and 2 other fields | High correlation |
Row is highly correlated with periodo_credito and 4 other fields | High correlation |
otros_ingresos_smdlv is highly correlated with sueldo_smdlv | High correlation |
municipio_nacimiento is highly correlated with municipio_residencia and 3 other fields | High correlation |
municipio_expedicion is highly correlated with municipio_nacimiento | High correlation |
tipo_persona is highly correlated with genero | High correlation |
municipio_credito is highly correlated with municipio_residencia | High correlation |
municipio_residencia is highly correlated with municipio_credito | High correlation |
genero is highly correlated with tipo_persona | High correlation |
forma_pago is highly correlated with estado_final | High correlation |
estado_final is highly correlated with forma_pago | High correlation |
municipio_nacimiento is highly correlated with municipio_expedicion | High correlation |
genero has 151596 (56.8%) missing values | Missing |
estado_civil has 157960 (59.2%) missing values | Missing |
edad has 168709 (63.2%) missing values | Missing |
municipio_nacimiento has 11704 (4.4%) missing values | Missing |
municipio_expedicion has 142399 (53.4%) missing values | Missing |
tiene_casa_propia has 158961 (59.6%) missing values | Missing |
sueldo_smdlv has 174837 (65.5%) missing values | Missing |
otros_ingresos_smdlv has 260110 (97.5%) missing values | Missing |
Row is uniformly distributed | Uniform |
Row has unique values | Unique |
Reproduction
| Analysis started | 2021-05-12 19:52:36.338663 |
|---|---|
| Analysis finished | 2021-05-12 19:53:51.288029 |
| Duration | 1 minute and 14.95 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 266824 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 133412.5 |
| Minimum | 1 |
|---|---|
| Maximum | 266824 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 13342.15 |
| Q1 | 66706.75 |
| median | 133412.5 |
| Q3 | 200118.25 |
| 95-th percentile | 253482.85 |
| Maximum | 266824 |
| Range | 266823 |
| Interquartile range (IQR) | 133411.5 |
Descriptive statistics
| Standard deviation | 77025.59845 |
|---|---|
| Coefficient of variation (CV) | 0.5773491873 |
| Kurtosis | -1.2 |
| Mean | 133412.5 |
| Median Absolute Deviation (MAD) | 66706 |
| Skewness | -1.074941117 × 10-15 |
| Sum | 3.55976569 × 1010 |
| Variance | 5932942817 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 2047 | 1 | < 0.1% |
| 92792 | 1 | < 0.1% |
| 72310 | 1 | < 0.1% |
| 66165 | 1 | < 0.1% |
| 68212 | 1 | < 0.1% |
| 78451 | 1 | < 0.1% |
| 80498 | 1 | < 0.1% |
| 74353 | 1 | < 0.1% |
| 76400 | 1 | < 0.1% |
| 119407 | 1 | < 0.1% |
| Other values (266814) | 266814 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 266824 | 1 | |
| 266823 | 1 | |
| 266822 | 1 | |
| 266821 | 1 | |
| 266820 | 1 | |
| 266819 | 1 | |
| 266818 | 1 | |
| 266817 | 1 | |
| 266816 | 1 | |
| 266815 | 1 |
procedencia
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| Nacional | |
|---|---|
| Extranjero | 723 |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 8.005419303 |
| Min length | 8 |
Characters and Unicode
| Total characters | 2136038 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Nacional |
|---|---|
| 2nd row | Nacional |
| 3rd row | Nacional |
| 4th row | Nacional |
| 5th row | Nacional |
Common Values
| Value | Count | Frequency (%) |
| Nacional | 266101 | |
| Extranjero | 723 | 0.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| nacional | 266101 | |
| extranjero | 723 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 532925 | |
| o | 266824 | |
| n | 266824 | |
| N | 266101 | |
| c | 266101 | |
| i | 266101 | |
| l | 266101 | |
| r | 1446 | 0.1% |
| E | 723 | < 0.1% |
| x | 723 | < 0.1% |
| Other values (3) | 2169 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1869214 | |
| Uppercase Letter | 266824 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 532925 | |
| o | 266824 | |
| n | 266824 | |
| c | 266101 | |
| i | 266101 | |
| l | 266101 | |
| r | 1446 | 0.1% |
| x | 723 | < 0.1% |
| t | 723 | < 0.1% |
| j | 723 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 266101 | |
| E | 723 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2136038 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 532925 | |
| o | 266824 | |
| n | 266824 | |
| N | 266101 | |
| c | 266101 | |
| i | 266101 | |
| l | 266101 | |
| r | 1446 | 0.1% |
| E | 723 | < 0.1% |
| x | 723 | < 0.1% |
| Other values (3) | 2169 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2136038 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 532925 | |
| o | 266824 | |
| n | 266824 | |
| N | 266101 | |
| c | 266101 | |
| i | 266101 | |
| l | 266101 | |
| r | 1446 | 0.1% |
| E | 723 | < 0.1% |
| x | 723 | < 0.1% |
| Other values (3) | 2169 | 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 151596 |
| Missing (%) | 56.8% |
| Memory size | 2.0 MiB |
| Femenino | |
|---|---|
| Masculino |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8.464522512 |
| Min length | 8 |
Characters and Unicode
| Total characters | 975350 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Femenino |
|---|---|
| 2nd row | Femenino |
| 3rd row | Femenino |
| 4th row | Femenino |
| 5th row | Femenino |
Common Values
| Value | Count | Frequency (%) |
| Femenino | 61702 | |
| Masculino | 53526 | 20.1% |
| (Missing) | 151596 |
Length
Pie chart
| Value | Count | Frequency (%) |
| femenino | 61702 | |
| masculino | 53526 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 176930 | |
| e | 123404 | |
| i | 115228 | |
| o | 115228 | |
| F | 61702 | 6.3% |
| m | 61702 | 6.3% |
| M | 53526 | 5.5% |
| a | 53526 | 5.5% |
| s | 53526 | 5.5% |
| c | 53526 | 5.5% |
| Other values (2) | 107052 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 860122 | |
| Uppercase Letter | 115228 | 11.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 176930 | |
| e | 123404 | |
| i | 115228 | |
| o | 115228 | |
| m | 61702 | 7.2% |
| a | 53526 | 6.2% |
| s | 53526 | 6.2% |
| c | 53526 | 6.2% |
| u | 53526 | 6.2% |
| l | 53526 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 61702 | |
| M | 53526 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 975350 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 176930 | |
| e | 123404 | |
| i | 115228 | |
| o | 115228 | |
| F | 61702 | 6.3% |
| m | 61702 | 6.3% |
| M | 53526 | 5.5% |
| a | 53526 | 5.5% |
| s | 53526 | 5.5% |
| c | 53526 | 5.5% |
| Other values (2) | 107052 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 975350 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 176930 | |
| e | 123404 | |
| i | 115228 | |
| o | 115228 | |
| F | 61702 | 6.3% |
| m | 61702 | 6.3% |
| M | 53526 | 5.5% |
| a | 53526 | 5.5% |
| s | 53526 | 5.5% |
| c | 53526 | 5.5% |
| Other values (2) | 107052 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 157960 |
| Missing (%) | 59.2% |
| Memory size | 2.0 MiB |
| Union Libre | |
|---|---|
| Soltero | |
| Casado | |
| Viudo | 1006 |
| Divorciado | 580 |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 8.217335391 |
| Min length | 5 |
Characters and Unicode
| Total characters | 894572 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Union Libre |
|---|---|
| 2nd row | Union Libre |
| 3rd row | Union Libre |
| 4th row | Union Libre |
| 5th row | Soltero |
Common Values
| Value | Count | Frequency (%) |
| Union Libre | 41320 | 15.5% |
| Soltero | 33474 | 12.5% |
| Casado | 32484 | 12.2% |
| Viudo | 1006 | 0.4% |
| Divorciado | 580 | 0.2% |
| (Missing) | 157960 |
Length
Pie chart
| Value | Count | Frequency (%) |
| union | 41320 | |
| libre | 41320 | |
| soltero | 33474 | |
| casado | 32484 | |
| viudo | 1006 | 0.7% |
| divorciado | 580 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 142918 | |
| i | 84806 | |
| n | 82640 | |
| r | 75374 | 8.4% |
| e | 74794 | 8.4% |
| a | 65548 | 7.3% |
| U | 41320 | 4.6% |
| 41320 | 4.6% | |
| L | 41320 | 4.6% |
| b | 41320 | 4.6% |
| Other values (11) | 203212 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 703068 | |
| Uppercase Letter | 150184 | 16.8% |
| Space Separator | 41320 | 4.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 142918 | |
| i | 84806 | |
| n | 82640 | |
| r | 75374 | |
| e | 74794 | |
| a | 65548 | |
| b | 41320 | 5.9% |
| d | 34070 | 4.8% |
| l | 33474 | 4.8% |
| t | 33474 | 4.8% |
| Other values (4) | 34650 | 4.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 41320 | |
| L | 41320 | |
| S | 33474 | |
| C | 32484 | |
| V | 1006 | 0.7% |
| D | 580 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 41320 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 853252 | |
| Common | 41320 | 4.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 142918 | |
| i | 84806 | |
| n | 82640 | |
| r | 75374 | |
| e | 74794 | |
| a | 65548 | 7.7% |
| U | 41320 | 4.8% |
| L | 41320 | 4.8% |
| b | 41320 | 4.8% |
| d | 34070 | 4.0% |
| Other values (10) | 169142 |
Common
| Value | Count | Frequency (%) |
| 41320 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 894572 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 142918 | |
| i | 84806 | |
| n | 82640 | |
| r | 75374 | 8.4% |
| e | 74794 | 8.4% |
| a | 65548 | 7.3% |
| U | 41320 | 4.6% |
| 41320 | 4.6% | |
| L | 41320 | 4.6% |
| b | 41320 | 4.6% |
| Other values (11) | 203212 |
| Distinct | 74 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 168709 |
| Missing (%) | 63.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.13916323 |
| Minimum | 18 |
|---|---|
| Maximum | 98 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 26 |
| Q1 | 35 |
| median | 43 |
| Q3 | 52 |
| 95-th percentile | 64 |
| Maximum | 98 |
| Range | 80 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 11.92553382 |
|---|---|
| Coefficient of variation (CV) | 0.270180333 |
| Kurtosis | -0.3887701935 |
| Mean | 44.13916323 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.3365665568 |
| Sum | 4330714 |
| Variance | 142.2183569 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 44 | 3614 | 1.4% |
| 40 | 3359 | 1.3% |
| 41 | 3123 | 1.2% |
| 45 | 3111 | 1.2% |
| 49 | 2996 | 1.1% |
| 35 | 2994 | 1.1% |
| 46 | 2883 | 1.1% |
| 31 | 2843 | 1.1% |
| 36 | 2759 | 1.0% |
| 37 | 2757 | 1.0% |
| Other values (64) | 67676 | |
| (Missing) | 168709 |
| Value | Count | Frequency (%) |
| 18 | 12 | < 0.1% |
| 19 | 72 | < 0.1% |
| 20 | 222 | 0.1% |
| 21 | 326 | 0.1% |
| 22 | 465 | 0.2% |
| 23 | 653 | 0.2% |
| 24 | 784 | |
| 25 | 1195 | |
| 26 | 1476 | |
| 27 | 1659 |
| Value | Count | Frequency (%) |
| 98 | 2 | < 0.1% |
| 90 | 3 | < 0.1% |
| 89 | 1 | < 0.1% |
| 88 | 2 | < 0.1% |
| 87 | 1 | < 0.1% |
| 86 | 1 | < 0.1% |
| 85 | 6 | < 0.1% |
| 84 | 59 | |
| 83 | 50 | |
| 82 | 27 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| ARAUCA | |
|---|---|
| TAME | |
| SARAVENA | |
| Otros | |
| ARAUQUITA |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 5.841539742 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1558663 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SARAVENA |
|---|---|
| 2nd row | ARAUCA |
| 3rd row | ARAUQUITA |
| 4th row | ARAUCA |
| 5th row | ARAUQUITA |
Common Values
| Value | Count | Frequency (%) |
| ARAUCA | 130507 | |
| TAME | 69756 | |
| SARAVENA | 35432 | 13.3% |
| Otros | 16755 | 6.3% |
| ARAUQUITA | 14374 | 5.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| arauca | 130507 | |
| tame | 69756 | |
| saravena | 35432 | 13.3% |
| otros | 16755 | 6.3% |
| arauquita | 14374 | 5.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 610695 | |
| R | 180313 | 11.6% |
| U | 159255 | 10.2% |
| C | 130507 | 8.4% |
| E | 105188 | 6.7% |
| T | 84130 | 5.4% |
| M | 69756 | 4.5% |
| S | 35432 | 2.3% |
| V | 35432 | 2.3% |
| N | 35432 | 2.3% |
| Other values (7) | 112523 | 7.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1491643 | |
| Lowercase Letter | 67020 | 4.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 610695 | |
| R | 180313 | 12.1% |
| U | 159255 | 10.7% |
| C | 130507 | 8.7% |
| E | 105188 | 7.1% |
| T | 84130 | 5.6% |
| M | 69756 | 4.7% |
| S | 35432 | 2.4% |
| V | 35432 | 2.4% |
| N | 35432 | 2.4% |
| Other values (3) | 45503 | 3.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 16755 | |
| r | 16755 | |
| o | 16755 | |
| s | 16755 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1558663 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 610695 | |
| R | 180313 | 11.6% |
| U | 159255 | 10.2% |
| C | 130507 | 8.4% |
| E | 105188 | 6.7% |
| T | 84130 | 5.4% |
| M | 69756 | 4.5% |
| S | 35432 | 2.3% |
| V | 35432 | 2.3% |
| N | 35432 | 2.3% |
| Other values (7) | 112523 | 7.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1558663 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 610695 | |
| R | 180313 | 11.6% |
| U | 159255 | 10.2% |
| C | 130507 | 8.4% |
| E | 105188 | 6.7% |
| T | 84130 | 5.4% |
| M | 69756 | 4.5% |
| S | 35432 | 2.3% |
| V | 35432 | 2.3% |
| N | 35432 | 2.3% |
| Other values (7) | 112523 | 7.2% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11704 |
| Missing (%) | 4.4% |
| Memory size | 2.0 MiB |
| ARAUCA | |
|---|---|
| Otros | |
| TAME | |
| ARAUQUITA | 10188 |
| SARAVENA | 8012 |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 5.829672311 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1487266 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Otros |
|---|---|
| 2nd row | ARAUQUITA |
| 3rd row | ARAUQUITA |
| 4th row | ARAUQUITA |
| 5th row | ARAUCA |
Common Values
| Value | Count | Frequency (%) |
| ARAUCA | 171467 | |
| Otros | 40864 | 15.3% |
| TAME | 24589 | 9.2% |
| ARAUQUITA | 10188 | 3.8% |
| SARAVENA | 8012 | 3.0% |
| (Missing) | 11704 | 4.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| arauca | 171467 | |
| otros | 40864 | 16.0% |
| tame | 24589 | 9.6% |
| arauquita | 10188 | 4.0% |
| saravena | 8012 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 593590 | |
| U | 191843 | 12.9% |
| R | 189667 | 12.8% |
| C | 171467 | 11.5% |
| O | 40864 | 2.7% |
| t | 40864 | 2.7% |
| r | 40864 | 2.7% |
| o | 40864 | 2.7% |
| s | 40864 | 2.7% |
| T | 34777 | 2.3% |
| Other values (7) | 101602 | 6.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1323810 | |
| Lowercase Letter | 163456 | 11.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 593590 | |
| U | 191843 | 14.5% |
| R | 189667 | 14.3% |
| C | 171467 | 13.0% |
| O | 40864 | 3.1% |
| T | 34777 | 2.6% |
| E | 32601 | 2.5% |
| M | 24589 | 1.9% |
| Q | 10188 | 0.8% |
| I | 10188 | 0.8% |
| Other values (3) | 24036 | 1.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 40864 | |
| r | 40864 | |
| o | 40864 | |
| s | 40864 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1487266 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 593590 | |
| U | 191843 | 12.9% |
| R | 189667 | 12.8% |
| C | 171467 | 11.5% |
| O | 40864 | 2.7% |
| t | 40864 | 2.7% |
| r | 40864 | 2.7% |
| o | 40864 | 2.7% |
| s | 40864 | 2.7% |
| T | 34777 | 2.3% |
| Other values (7) | 101602 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1487266 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 593590 | |
| U | 191843 | 12.9% |
| R | 189667 | 12.8% |
| C | 171467 | 11.5% |
| O | 40864 | 2.7% |
| t | 40864 | 2.7% |
| r | 40864 | 2.7% |
| o | 40864 | 2.7% |
| s | 40864 | 2.7% |
| T | 34777 | 2.3% |
| Other values (7) | 101602 | 6.8% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 142399 |
| Missing (%) | 53.4% |
| Memory size | 2.0 MiB |
| ARAUCA | |
|---|---|
| Otros | |
| TAME | |
| ARAUQUITA | |
| SARAVENA | 4209 |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 5.64299779 |
| Min length | 4 |
Characters and Unicode
| Total characters | 702130 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Otros |
|---|---|
| 2nd row | ARAUCA |
| 3rd row | ARAUQUITA |
| 4th row | ARAUQUITA |
| 5th row | ARAUCA |
Common Values
| Value | Count | Frequency (%) |
| ARAUCA | 55057 | 20.6% |
| Otros | 39115 | 14.7% |
| TAME | 18371 | 6.9% |
| ARAUQUITA | 7673 | 2.9% |
| SARAVENA | 4209 | 1.6% |
| (Missing) | 142399 |
Length
Pie chart
| Value | Count | Frequency (%) |
| arauca | 55057 | |
| otros | 39115 | |
| tame | 18371 | 14.8% |
| arauquita | 7673 | 6.2% |
| saravena | 4209 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 219188 | |
| U | 70403 | 10.0% |
| R | 66939 | 9.5% |
| C | 55057 | 7.8% |
| O | 39115 | 5.6% |
| t | 39115 | 5.6% |
| r | 39115 | 5.6% |
| o | 39115 | 5.6% |
| s | 39115 | 5.6% |
| T | 26044 | 3.7% |
| Other values (7) | 68924 | 9.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 545670 | |
| Lowercase Letter | 156460 | 22.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 219188 | |
| U | 70403 | 12.9% |
| R | 66939 | 12.3% |
| C | 55057 | 10.1% |
| O | 39115 | 7.2% |
| T | 26044 | 4.8% |
| E | 22580 | 4.1% |
| M | 18371 | 3.4% |
| Q | 7673 | 1.4% |
| I | 7673 | 1.4% |
| Other values (3) | 12627 | 2.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 39115 | |
| r | 39115 | |
| o | 39115 | |
| s | 39115 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 702130 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 219188 | |
| U | 70403 | 10.0% |
| R | 66939 | 9.5% |
| C | 55057 | 7.8% |
| O | 39115 | 5.6% |
| t | 39115 | 5.6% |
| r | 39115 | 5.6% |
| o | 39115 | 5.6% |
| s | 39115 | 5.6% |
| T | 26044 | 3.7% |
| Other values (7) | 68924 | 9.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 702130 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 219188 | |
| U | 70403 | 10.0% |
| R | 66939 | 9.5% |
| C | 55057 | 7.8% |
| O | 39115 | 5.6% |
| t | 39115 | 5.6% |
| r | 39115 | 5.6% |
| o | 39115 | 5.6% |
| s | 39115 | 5.6% |
| T | 26044 | 3.7% |
| Other values (7) | 68924 | 9.8% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| Natural | |
|---|---|
| Juridica | 5069 |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.018997541 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1872837 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Natural |
|---|---|
| 2nd row | Natural |
| 3rd row | Natural |
| 4th row | Natural |
| 5th row | Natural |
Common Values
| Value | Count | Frequency (%) |
| Natural | 261755 | |
| Juridica | 5069 | 1.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| natural | 261755 | |
| juridica | 5069 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 528579 | |
| u | 266824 | |
| r | 266824 | |
| N | 261755 | |
| t | 261755 | |
| l | 261755 | |
| i | 10138 | 0.5% |
| J | 5069 | 0.3% |
| d | 5069 | 0.3% |
| c | 5069 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1606013 | |
| Uppercase Letter | 266824 | 14.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 528579 | |
| u | 266824 | |
| r | 266824 | |
| t | 261755 | |
| l | 261755 | |
| i | 10138 | 0.6% |
| d | 5069 | 0.3% |
| c | 5069 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 261755 | |
| J | 5069 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1872837 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 528579 | |
| u | 266824 | |
| r | 266824 | |
| N | 261755 | |
| t | 261755 | |
| l | 261755 | |
| i | 10138 | 0.5% |
| J | 5069 | 0.3% |
| d | 5069 | 0.3% |
| c | 5069 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1872837 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 528579 | |
| u | 266824 | |
| r | 266824 | |
| N | 261755 | |
| t | 261755 | |
| l | 261755 | |
| i | 10138 | 0.5% |
| J | 5069 | 0.3% |
| d | 5069 | 0.3% |
| c | 5069 | 0.3% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 158961 |
| Missing (%) | 59.6% |
| Memory size | 2.0 MiB |
| Si | |
|---|---|
| No |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 215726 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Si |
|---|---|
| 2nd row | No |
| 3rd row | No |
| 4th row | No |
| 5th row | No |
Common Values
| Value | Count | Frequency (%) |
| Si | 78054 | |
| No | 29809 | 11.2% |
| (Missing) | 158961 |
Length
Pie chart
| Value | Count | Frequency (%) |
| si | 78054 | |
| no | 29809 | 27.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 78054 | |
| i | 78054 | |
| N | 29809 | 13.8% |
| o | 29809 | 13.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 107863 | |
| Lowercase Letter | 107863 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 78054 | |
| N | 29809 | 27.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 78054 | |
| o | 29809 | 27.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 215726 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 78054 | |
| i | 78054 | |
| N | 29809 | 13.8% |
| o | 29809 | 13.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 215726 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 78054 | |
| i | 78054 | |
| N | 29809 | 13.8% |
| o | 29809 | 13.8% |
| Distinct | 524 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 174837 |
| Missing (%) | 65.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 103.2626784 |
| Minimum | 3 |
|---|---|
| Maximum | 600 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 27 |
| Q1 | 43 |
| median | 73 |
| Q3 | 130 |
| 95-th percentile | 286 |
| Maximum | 600 |
| Range | 597 |
| Interquartile range (IQR) | 87 |
Descriptive statistics
| Standard deviation | 90.27400954 |
|---|---|
| Coefficient of variation (CV) | 0.8742171995 |
| Kurtosis | 7.070204741 |
| Mean | 103.2626784 |
| Median Absolute Deviation (MAD) | 35 |
| Skewness | 2.324892561 |
| Sum | 9498824 |
| Variance | 8149.396798 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34 | 2384 | 0.9% |
| 36 | 1895 | 0.7% |
| 43 | 1869 | 0.7% |
| 61 | 1650 | 0.6% |
| 72 | 1641 | 0.6% |
| 30 | 1616 | 0.6% |
| 68 | 1587 | 0.6% |
| 46 | 1483 | 0.6% |
| 76 | 1448 | 0.5% |
| 65 | 1415 | 0.5% |
| Other values (514) | 74999 | |
| (Missing) | 174837 |
| Value | Count | Frequency (%) |
| 3 | 13 | < 0.1% |
| 4 | 3 | < 0.1% |
| 5 | 19 | < 0.1% |
| 6 | 31 | < 0.1% |
| 7 | 60 | < 0.1% |
| 8 | 25 | < 0.1% |
| 9 | 37 | < 0.1% |
| 10 | 205 | |
| 11 | 55 | < 0.1% |
| 12 | 47 | < 0.1% |
| Value | Count | Frequency (%) |
| 600 | 388 | |
| 599 | 2 | < 0.1% |
| 597 | 1 | < 0.1% |
| 594 | 1 | < 0.1% |
| 593 | 1 | < 0.1% |
| 588 | 53 | < 0.1% |
| 586 | 6 | < 0.1% |
| 584 | 7 | < 0.1% |
| 582 | 15 | < 0.1% |
| 578 | 1 | < 0.1% |
| Distinct | 222 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 260110 |
| Missing (%) | 97.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 55.00417039 |
| Minimum | 0 |
|---|---|
| Maximum | 300 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 19 |
| median | 34 |
| Q3 | 68 |
| 95-th percentile | 182.7 |
| Maximum | 300 |
| Range | 300 |
| Interquartile range (IQR) | 49 |
Descriptive statistics
| Standard deviation | 58.67205411 |
|---|---|
| Coefficient of variation (CV) | 1.066683739 |
| Kurtosis | 5.815075647 |
| Mean | 55.00417039 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | 2.338335015 |
| Sum | 369298 |
| Variance | 3442.409933 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34 | 227 | 0.1% |
| 17 | 196 | 0.1% |
| 18 | 193 | 0.1% |
| 36 | 191 | 0.1% |
| 20 | 157 | 0.1% |
| 23 | 156 | 0.1% |
| 13 | 154 | 0.1% |
| 30 | 143 | 0.1% |
| 21 | 142 | 0.1% |
| 26 | 131 | < 0.1% |
| Other values (212) | 5024 | 1.9% |
| (Missing) | 260110 |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 1 | 9 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 64 | |
| 4 | 45 | < 0.1% |
| 5 | 40 | < 0.1% |
| 6 | 94 | |
| 7 | 119 | |
| 8 | 72 | |
| 9 | 57 |
| Value | Count | Frequency (%) |
| 300 | 109 | |
| 294 | 8 | < 0.1% |
| 291 | 11 | < 0.1% |
| 289 | 1 | < 0.1% |
| 288 | 3 | < 0.1% |
| 280 | 2 | < 0.1% |
| 279 | 2 | < 0.1% |
| 276 | 1 | < 0.1% |
| 271 | 7 | < 0.1% |
| 268 | 3 | < 0.1% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| ARAUCA | |
|---|---|
| TAME | |
| SARAVENA | |
| ARAUQUITA | |
| Otros |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 5.923443918 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1580517 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SARAVENA |
|---|---|
| 2nd row | ARAUCA |
| 3rd row | ARAUQUITA |
| 4th row | Otros |
| 5th row | ARAUQUITA |
Common Values
| Value | Count | Frequency (%) |
| ARAUCA | 127641 | |
| TAME | 63795 | |
| SARAVENA | 29609 | 11.1% |
| ARAUQUITA | 23431 | 8.8% |
| Otros | 22348 | 8.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| arauca | 127641 | |
| tame | 63795 | |
| saravena | 29609 | 11.1% |
| arauquita | 23431 | 8.8% |
| otros | 22348 | 8.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 605838 | |
| R | 180681 | 11.4% |
| U | 174503 | 11.0% |
| C | 127641 | 8.1% |
| E | 93404 | 5.9% |
| T | 87226 | 5.5% |
| M | 63795 | 4.0% |
| S | 29609 | 1.9% |
| V | 29609 | 1.9% |
| N | 29609 | 1.9% |
| Other values (7) | 158602 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1491125 | |
| Lowercase Letter | 89392 | 5.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 605838 | |
| R | 180681 | 12.1% |
| U | 174503 | 11.7% |
| C | 127641 | 8.6% |
| E | 93404 | 6.3% |
| T | 87226 | 5.8% |
| M | 63795 | 4.3% |
| S | 29609 | 2.0% |
| V | 29609 | 2.0% |
| N | 29609 | 2.0% |
| Other values (3) | 69210 | 4.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 22348 | |
| r | 22348 | |
| o | 22348 | |
| s | 22348 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1580517 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 605838 | |
| R | 180681 | 11.4% |
| U | 174503 | 11.0% |
| C | 127641 | 8.1% |
| E | 93404 | 5.9% |
| T | 87226 | 5.5% |
| M | 63795 | 4.0% |
| S | 29609 | 1.9% |
| V | 29609 | 1.9% |
| N | 29609 | 1.9% |
| Other values (7) | 158602 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1580517 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 605838 | |
| R | 180681 | 11.4% |
| U | 174503 | 11.0% |
| C | 127641 | 8.1% |
| E | 93404 | 5.9% |
| T | 87226 | 5.5% |
| M | 63795 | 4.0% |
| S | 29609 | 1.9% |
| V | 29609 | 1.9% |
| N | 29609 | 1.9% |
| Other values (7) | 158602 | 10.0% |
codeudor
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| SIN CODEUDOR | |
|---|---|
| CON CODEUDOR |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Characters and Unicode
| Total characters | 3201888 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SIN CODEUDOR |
|---|---|
| 2nd row | CON CODEUDOR |
| 3rd row | CON CODEUDOR |
| 4th row | CON CODEUDOR |
| 5th row | CON CODEUDOR |
Common Values
| Value | Count | Frequency (%) |
| SIN CODEUDOR | 239311 | |
| CON CODEUDOR | 27513 | 10.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| codeudor | 266824 | |
| sin | 239311 | |
| con | 27513 | 5.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 561161 | |
| D | 533648 | |
| C | 294337 | |
| N | 266824 | |
| 266824 | ||
| E | 266824 | |
| U | 266824 | |
| R | 266824 | |
| S | 239311 | |
| I | 239311 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2935064 | |
| Space Separator | 266824 | 8.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 561161 | |
| D | 533648 | |
| C | 294337 | |
| N | 266824 | |
| E | 266824 | |
| U | 266824 | |
| R | 266824 | |
| S | 239311 | |
| I | 239311 |
Space Separator
| Value | Count | Frequency (%) |
| 266824 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2935064 | |
| Common | 266824 | 8.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 561161 | |
| D | 533648 | |
| C | 294337 | |
| N | 266824 | |
| E | 266824 | |
| U | 266824 | |
| R | 266824 | |
| S | 239311 | |
| I | 239311 |
Common
| Value | Count | Frequency (%) |
| 266824 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3201888 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 561161 | |
| D | 533648 | |
| C | 294337 | |
| N | 266824 | |
| 266824 | ||
| E | 266824 | |
| U | 266824 | |
| R | 266824 | |
| S | 239311 | |
| I | 239311 |
sector
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Memory size | 2.0 MiB |
| PRIVADO | |
|---|---|
| PUBLICO | 22876 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1867642 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRIVADO |
|---|---|
| 2nd row | PRIVADO |
| 3rd row | PRIVADO |
| 4th row | PRIVADO |
| 5th row | PRIVADO |
Common Values
| Value | Count | Frequency (%) |
| PRIVADO | 243930 | |
| PUBLICO | 22876 | 8.6% |
| (Missing) | 18 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| privado | 243930 | |
| publico | 22876 | 8.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 266806 | |
| I | 266806 | |
| O | 266806 | |
| R | 243930 | |
| V | 243930 | |
| A | 243930 | |
| D | 243930 | |
| U | 22876 | 1.2% |
| B | 22876 | 1.2% |
| L | 22876 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1867642 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 266806 | |
| I | 266806 | |
| O | 266806 | |
| R | 243930 | |
| V | 243930 | |
| A | 243930 | |
| D | 243930 | |
| U | 22876 | 1.2% |
| B | 22876 | 1.2% |
| L | 22876 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1867642 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 266806 | |
| I | 266806 | |
| O | 266806 | |
| R | 243930 | |
| V | 243930 | |
| A | 243930 | |
| D | 243930 | |
| U | 22876 | 1.2% |
| B | 22876 | 1.2% |
| L | 22876 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1867642 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 266806 | |
| I | 266806 | |
| O | 266806 | |
| R | 243930 | |
| V | 243930 | |
| A | 243930 | |
| D | 243930 | |
| U | 22876 | 1.2% |
| B | 22876 | 1.2% |
| L | 22876 | 1.2% |
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2013.138233 |
| Minimum | 1993 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 1993 |
|---|---|
| 5-th percentile | 2001 |
| Q1 | 2010 |
| median | 2014 |
| Q3 | 2018 |
| 95-th percentile | 2020 |
| Maximum | 2021 |
| Range | 28 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 5.723918046 |
|---|---|
| Coefficient of variation (CV) | 0.002843281177 |
| Kurtosis | -0.1314163128 |
| Mean | 2013.138233 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.7662398934 |
| Sum | 537153596 |
| Variance | 32.76323779 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2020 | 28739 | 10.8% |
| 2015 | 21814 | 8.2% |
| 2019 | 20497 | 7.7% |
| 2014 | 17957 | 6.7% |
| 2017 | 17407 | 6.5% |
| 2018 | 17236 | 6.5% |
| 2016 | 16822 | 6.3% |
| 2013 | 16573 | 6.2% |
| 2012 | 14917 | 5.6% |
| 2011 | 13474 | 5.0% |
| Other values (17) | 81388 |
| Value | Count | Frequency (%) |
| 1993 | 1 | < 0.1% |
| 1996 | 1 | < 0.1% |
| 1997 | 1144 | 0.4% |
| 1998 | 2317 | |
| 1999 | 3094 | |
| 2000 | 3237 | |
| 2001 | 3676 | |
| 2002 | 3648 | |
| 2003 | 3823 | |
| 2004 | 4843 |
| Value | Count | Frequency (%) |
| 2021 | 5966 | 2.2% |
| 2020 | 28739 | |
| 2019 | 20497 | |
| 2018 | 17236 | |
| 2017 | 17407 | |
| 2016 | 16822 | |
| 2015 | 21814 | |
| 2014 | 17957 | |
| 2013 | 16573 | |
| 2012 | 14917 |
mes_credito
Real number (ℝ≥0)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.880040776 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.498950777 |
|---|---|
| Coefficient of variation (CV) | 0.5085654127 |
| Kurtosis | -1.227123233 |
| Mean | 6.880040776 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.1234824413 |
| Sum | 1835760 |
| Variance | 12.24265654 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 29294 | |
| 10 | 25731 | |
| 11 | 24466 | |
| 9 | 22504 | |
| 7 | 22413 | |
| 6 | 21385 | |
| 5 | 21295 | |
| 8 | 20835 | |
| 3 | 20443 | |
| 2 | 19931 | |
| Other values (2) | 38527 |
| Value | Count | Frequency (%) |
| 1 | 19465 | |
| 2 | 19931 | |
| 3 | 20443 | |
| 4 | 19062 | |
| 5 | 21295 | |
| 6 | 21385 | |
| 7 | 22413 | |
| 8 | 20835 | |
| 9 | 22504 | |
| 10 | 25731 |
| Value | Count | Frequency (%) |
| 12 | 29294 | |
| 11 | 24466 | |
| 10 | 25731 | |
| 9 | 22504 | |
| 8 | 20835 | |
| 7 | 22413 | |
| 6 | 21385 | |
| 5 | 21295 | |
| 4 | 19062 | |
| 3 | 20443 |
valor_credito_smdlv
Real number (ℝ≥0)
| Distinct | 672 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 57.19756094 |
| Minimum | 0 |
|---|---|
| Maximum | 700 |
| Zeros | 2290 |
| Zeros (%) | 0.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 15 |
| median | 37 |
| Q3 | 68 |
| 95-th percentile | 200 |
| Maximum | 700 |
| Range | 700 |
| Interquartile range (IQR) | 53 |
Descriptive statistics
| Standard deviation | 73.32605347 |
|---|---|
| Coefficient of variation (CV) | 1.281978677 |
| Kurtosis | 19.22517259 |
| Mean | 57.19756094 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 3.593908011 |
| Sum | 15261682 |
| Variance | 5376.710118 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 8907 | 3.3% |
| 5 | 5947 | 2.2% |
| 7 | 5743 | 2.2% |
| 1 | 5497 | 2.1% |
| 2 | 5289 | 2.0% |
| 15 | 5055 | 1.9% |
| 16 | 4541 | 1.7% |
| 14 | 4367 | 1.6% |
| 8 | 3908 | 1.5% |
| 4 | 3823 | 1.4% |
| Other values (662) | 213747 |
| Value | Count | Frequency (%) |
| 0 | 2290 | 0.9% |
| 1 | 5497 | |
| 2 | 5289 | |
| 3 | 3813 | |
| 4 | 3823 | |
| 5 | 5947 | |
| 6 | 8907 | |
| 7 | 5743 | |
| 8 | 3908 | |
| 9 | 3076 | 1.2% |
| Value | Count | Frequency (%) |
| 700 | 466 | |
| 698 | 1 | < 0.1% |
| 697 | 2 | < 0.1% |
| 696 | 1 | < 0.1% |
| 695 | 2 | < 0.1% |
| 694 | 3 | < 0.1% |
| 693 | 2 | < 0.1% |
| 692 | 1 | < 0.1% |
| 691 | 2 | < 0.1% |
| 690 | 1 | < 0.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| PAGADO VENCIDO | |
|---|---|
| CONTADO | |
| PAGADO ANTICIPADO | |
| PAGADO A TIEMPO | |
| DESCUENTO EN VENTA | |
| Other values (3) | 8453 |
Length
| Max length | 18 |
|---|---|
| Median length | 14 |
| Mean length | 12.79885617 |
| Min length | 7 |
Characters and Unicode
| Total characters | 3415042 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PAGADO ANTICIPADO |
|---|---|
| 2nd row | DESCUENTO EN VENTA |
| 3rd row | PAGADO A TIEMPO |
| 4th row | PAGADO ANTICIPADO |
| 5th row | PAGADO VENCIDO |
Common Values
| Value | Count | Frequency (%) |
| PAGADO VENCIDO | 94542 | |
| CONTADO | 78448 | |
| PAGADO ANTICIPADO | 49231 | |
| PAGADO A TIEMPO | 21853 | 8.2% |
| DESCUENTO EN VENTA | 14297 | 5.4% |
| CARTERA CASTIGADA | 4028 | 1.5% |
| OTROS CIERRES | 2508 | 0.9% |
| DEVOLUCION | 1917 | 0.7% |
Length
Pie chart
| Value | Count | Frequency (%) |
| pagado | 165626 | |
| vencido | 94542 | |
| contado | 78448 | |
| anticipado | 49231 | 10.1% |
| a | 21853 | 4.5% |
| tiempo | 21853 | 4.5% |
| en | 14297 | 2.9% |
| venta | 14297 | 2.9% |
| descuento | 14297 | 2.9% |
| cartera | 4028 | 0.8% |
| Other values (4) | 10961 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 564452 | |
| O | 511295 | |
| D | 408089 | |
| N | 267029 | |
| C | 248999 | |
| P | 236710 | |
| I | 223310 | 6.5% |
| 222609 | 6.5% | |
| T | 188690 | 5.5% |
| E | 184544 | 5.4% |
| Other values (7) | 359315 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3192433 | |
| Space Separator | 222609 | 6.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 564452 | |
| O | 511295 | |
| D | 408089 | |
| N | 267029 | |
| C | 248999 | |
| P | 236710 | |
| I | 223310 | 7.0% |
| T | 188690 | 5.9% |
| E | 184544 | 5.8% |
| G | 169654 | 5.3% |
| Other values (6) | 189661 | 5.9% |
Space Separator
| Value | Count | Frequency (%) |
| 222609 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3192433 | |
| Common | 222609 | 6.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 564452 | |
| O | 511295 | |
| D | 408089 | |
| N | 267029 | |
| C | 248999 | |
| P | 236710 | |
| I | 223310 | 7.0% |
| T | 188690 | 5.9% |
| E | 184544 | 5.8% |
| G | 169654 | 5.3% |
| Other values (6) | 189661 | 5.9% |
Common
| Value | Count | Frequency (%) |
| 222609 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3415042 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 564452 | |
| O | 511295 | |
| D | 408089 | |
| N | 267029 | |
| C | 248999 | |
| P | 236710 | |
| I | 223310 | 6.5% |
| 222609 | 6.5% | |
| T | 188690 | 5.5% |
| E | 184544 | 5.4% |
| Other values (7) | 359315 |
tipo_venta
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| ELECTRODOMESTICOS | |
|---|---|
| MOTOS | 886 |
| CONTRATO | 31 |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 16.95910788 |
| Min length | 5 |
Characters and Unicode
| Total characters | 4525097 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ELECTRODOMESTICOS |
|---|---|
| 2nd row | ELECTRODOMESTICOS |
| 3rd row | ELECTRODOMESTICOS |
| 4th row | ELECTRODOMESTICOS |
| 5th row | ELECTRODOMESTICOS |
Common Values
| Value | Count | Frequency (%) |
| ELECTRODOMESTICOS | 265907 | |
| MOTOS | 886 | 0.3% |
| CONTRATO | 31 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| electrodomesticos | 265907 | |
| motos | 886 | 0.3% |
| contrato | 31 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 799555 | |
| E | 797721 | |
| T | 532762 | |
| S | 532700 | |
| C | 531845 | |
| M | 266793 | 5.9% |
| R | 265938 | 5.9% |
| L | 265907 | 5.9% |
| D | 265907 | 5.9% |
| I | 265907 | 5.9% |
| Other values (2) | 62 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4525097 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 799555 | |
| E | 797721 | |
| T | 532762 | |
| S | 532700 | |
| C | 531845 | |
| M | 266793 | 5.9% |
| R | 265938 | 5.9% |
| L | 265907 | 5.9% |
| D | 265907 | 5.9% |
| I | 265907 | 5.9% |
| Other values (2) | 62 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4525097 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 799555 | |
| E | 797721 | |
| T | 532762 | |
| S | 532700 | |
| C | 531845 | |
| M | 266793 | 5.9% |
| R | 265938 | 5.9% |
| L | 265907 | 5.9% |
| D | 265907 | 5.9% |
| I | 265907 | 5.9% |
| Other values (2) | 62 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4525097 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 799555 | |
| E | 797721 | |
| T | 532762 | |
| S | 532700 | |
| C | 531845 | |
| M | 266793 | 5.9% |
| R | 265938 | 5.9% |
| L | 265907 | 5.9% |
| D | 265907 | 5.9% |
| I | 265907 | 5.9% |
| Other values (2) | 62 | < 0.1% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| MENSUAL(ES) | |
|---|---|
| DIARIA(S) | |
| SEMANAL(ES) | 1804 |
| QUINCENAL(ES) | 622 |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 10.60247204 |
| Min length | 9 |
Characters and Unicode
| Total characters | 2828994 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | QUINCENAL(ES) |
|---|---|
| 2nd row | MENSUAL(ES) |
| 3rd row | MENSUAL(ES) |
| 4th row | MENSUAL(ES) |
| 5th row | MENSUAL(ES) |
Common Values
| Value | Count | Frequency (%) |
| MENSUAL(ES) | 210741 | |
| DIARIA(S) | 53657 | 20.1% |
| SEMANAL(ES) | 1804 | 0.7% |
| QUINCENAL(ES) | 622 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| mensual(es | 210741 | |
| diaria(s | 53657 | 20.1% |
| semanal(es | 1804 | 0.7% |
| quincenal(es | 622 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 479369 | |
| E | 426334 | |
| A | 322285 | |
| ( | 266824 | |
| ) | 266824 | |
| N | 213789 | |
| L | 213167 | |
| M | 212545 | |
| U | 211363 | |
| I | 107936 | 3.8% |
| Other values (4) | 108558 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2295346 | |
| Open Punctuation | 266824 | 9.4% |
| Close Punctuation | 266824 | 9.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 479369 | |
| E | 426334 | |
| A | 322285 | |
| N | 213789 | |
| L | 213167 | |
| M | 212545 | |
| U | 211363 | |
| I | 107936 | 4.7% |
| D | 53657 | 2.3% |
| R | 53657 | 2.3% |
| Other values (2) | 1244 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 266824 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 266824 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2295346 | |
| Common | 533648 | 18.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 479369 | |
| E | 426334 | |
| A | 322285 | |
| N | 213789 | |
| L | 213167 | |
| M | 212545 | |
| U | 211363 | |
| I | 107936 | 4.7% |
| D | 53657 | 2.3% |
| R | 53657 | 2.3% |
| Other values (2) | 1244 | 0.1% |
Common
| Value | Count | Frequency (%) |
| ( | 266824 | |
| ) | 266824 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2828994 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 479369 | |
| E | 426334 | |
| A | 322285 | |
| ( | 266824 | |
| ) | 266824 | |
| N | 213789 | |
| L | 213167 | |
| M | 212545 | |
| U | 211363 | |
| I | 107936 | 3.8% |
| Other values (4) | 108558 | 3.8% |
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.928803256 |
| Minimum | 0 |
|---|---|
| Maximum | 14 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 6 |
| 95-th percentile | 14 |
| Maximum | 14 |
| Range | 14 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.194594669 |
|---|---|
| Coefficient of variation (CV) | 1.067652004 |
| Kurtosis | -0.06608681909 |
| Mean | 3.928803256 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.172608222 |
| Sum | 1048299 |
| Variance | 17.59462443 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 150307 | |
| 10 | 22835 | 8.6% |
| 5 | 14883 | 5.6% |
| 14 | 14199 | 5.3% |
| 3 | 10935 | 4.1% |
| 2 | 10602 | 4.0% |
| 6 | 10588 | 4.0% |
| 12 | 8202 | 3.1% |
| 4 | 8035 | 3.0% |
| 9 | 5644 | 2.1% |
| Other values (5) | 10594 | 4.0% |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 1 | 150307 | |
| 2 | 10602 | 4.0% |
| 3 | 10935 | 4.1% |
| 4 | 8035 | 3.0% |
| 5 | 14883 | 5.6% |
| 6 | 10588 | 4.0% |
| 7 | 2008 | 0.8% |
| 8 | 4062 | 1.5% |
| 9 | 5644 | 2.1% |
| Value | Count | Frequency (%) |
| 14 | 14199 | |
| 13 | 625 | 0.2% |
| 12 | 8202 | 3.1% |
| 11 | 3897 | 1.5% |
| 10 | 22835 | |
| 9 | 5644 | 2.1% |
| 8 | 4062 | 1.5% |
| 7 | 2008 | 0.8% |
| 6 | 10588 | |
| 5 | 14883 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| CRÉDITO | |
|---|---|
| CONTADO | |
| LIBRANZA | 5858 |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.021954547 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1873626 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CRÉDITO |
|---|---|
| 2nd row | CRÉDITO |
| 3rd row | CRÉDITO |
| 4th row | CRÉDITO |
| 5th row | CRÉDITO |
Common Values
| Value | Count | Frequency (%) |
| CRÉDITO | 182518 | |
| CONTADO | 78448 | |
| LIBRANZA | 5858 | 2.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| crédito | 182518 | |
| contado | 78448 | |
| libranza | 5858 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 339414 | |
| C | 260966 | |
| D | 260966 | |
| T | 260966 | |
| R | 188376 | |
| I | 188376 | |
| É | 182518 | |
| A | 90164 | 4.8% |
| N | 84306 | 4.5% |
| L | 5858 | 0.3% |
| Other values (2) | 11716 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1873626 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 339414 | |
| C | 260966 | |
| D | 260966 | |
| T | 260966 | |
| R | 188376 | |
| I | 188376 | |
| É | 182518 | |
| A | 90164 | 4.8% |
| N | 84306 | 4.5% |
| L | 5858 | 0.3% |
| Other values (2) | 11716 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1873626 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 339414 | |
| C | 260966 | |
| D | 260966 | |
| T | 260966 | |
| R | 188376 | |
| I | 188376 | |
| É | 182518 | |
| A | 90164 | 4.8% |
| N | 84306 | 4.5% |
| L | 5858 | 0.3% |
| Other values (2) | 11716 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1691108 | |
| Latin 1 Sup | 182518 | 9.7% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 339414 | |
| C | 260966 | |
| D | 260966 | |
| T | 260966 | |
| R | 188376 | |
| I | 188376 | |
| A | 90164 | 5.3% |
| N | 84306 | 5.0% |
| L | 5858 | 0.3% |
| B | 5858 | 0.3% |
Latin 1 Sup
| Value | Count | Frequency (%) |
| É | 182518 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Row | procedencia | genero | estado_civil | edad | municipio_residencia | municipio_nacimiento | municipio_expedicion | tipo_persona | tiene_casa_propia | sueldo_smdlv | otros_ingresos_smdlv | municipio_credito | codeudor | sector | año_credito | mes_credito | valor_credito_smdlv | estado_final | tipo_venta | periodo_credito | cuotas | forma_pago | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | Nacional | NaN | NaN | NaN | SARAVENA | NaN | NaN | Natural | NaN | NaN | NaN | SARAVENA | SIN CODEUDOR | PRIVADO | 2020 | 12 | 356 | PAGADO ANTICIPADO | ELECTRODOMESTICOS | QUINCENAL(ES) | 1 | CRÉDITO |
| 1 | 2 | Nacional | NaN | NaN | 70.0 | ARAUCA | Otros | Otros | Natural | Si | 170.0 | NaN | ARAUCA | CON CODEUDOR | PRIVADO | 2020 | 7 | 49 | DESCUENTO EN VENTA | ELECTRODOMESTICOS | MENSUAL(ES) | 5 | CRÉDITO |
| 2 | 3 | Nacional | NaN | NaN | NaN | ARAUQUITA | ARAUQUITA | ARAUCA | Natural | NaN | NaN | NaN | ARAUQUITA | CON CODEUDOR | PRIVADO | 2017 | 11 | 18 | PAGADO A TIEMPO | ELECTRODOMESTICOS | MENSUAL(ES) | 5 | CRÉDITO |
| 3 | 4 | Nacional | NaN | NaN | NaN | ARAUCA | NaN | NaN | Natural | NaN | NaN | NaN | Otros | CON CODEUDOR | PRIVADO | 2019 | 10 | 65 | PAGADO ANTICIPADO | ELECTRODOMESTICOS | MENSUAL(ES) | 14 | CRÉDITO |
| 4 | 5 | Nacional | NaN | NaN | NaN | ARAUQUITA | ARAUQUITA | ARAUQUITA | Natural | NaN | NaN | NaN | ARAUQUITA | CON CODEUDOR | PRIVADO | 2018 | 5 | 61 | PAGADO VENCIDO | ELECTRODOMESTICOS | MENSUAL(ES) | 12 | CRÉDITO |
| 5 | 6 | Nacional | NaN | NaN | NaN | ARAUQUITA | ARAUQUITA | ARAUQUITA | Natural | NaN | NaN | NaN | ARAUQUITA | CON CODEUDOR | PRIVADO | 2019 | 6 | 114 | PAGADO ANTICIPADO | ELECTRODOMESTICOS | MENSUAL(ES) | 12 | CRÉDITO |
| 6 | 7 | Nacional | NaN | NaN | NaN | ARAUCA | NaN | NaN | Natural | NaN | NaN | NaN | SARAVENA | CON CODEUDOR | PRIVADO | 2019 | 11 | 349 | PAGADO VENCIDO | MOTOS | MENSUAL(ES) | 12 | CRÉDITO |
| 7 | 8 | Nacional | NaN | NaN | NaN | SARAVENA | ARAUCA | ARAUCA | Natural | NaN | NaN | NaN | SARAVENA | CON CODEUDOR | PRIVADO | 2007 | 11 | 51 | PAGADO ANTICIPADO | ELECTRODOMESTICOS | MENSUAL(ES) | 6 | CRÉDITO |
| 8 | 9 | Nacional | NaN | NaN | NaN | ARAUCA | NaN | NaN | Natural | NaN | NaN | NaN | SARAVENA | SIN CODEUDOR | PRIVADO | 2019 | 12 | 284 | PAGADO VENCIDO | MOTOS | MENSUAL(ES) | 2 | CRÉDITO |
| 9 | 10 | Nacional | NaN | NaN | NaN | ARAUCA | NaN | NaN | Natural | NaN | NaN | NaN | SARAVENA | SIN CODEUDOR | PRIVADO | 2019 | 11 | 124 | PAGADO VENCIDO | ELECTRODOMESTICOS | MENSUAL(ES) | 12 | CRÉDITO |
Last rows
| Row | procedencia | genero | estado_civil | edad | municipio_residencia | municipio_nacimiento | municipio_expedicion | tipo_persona | tiene_casa_propia | sueldo_smdlv | otros_ingresos_smdlv | municipio_credito | codeudor | sector | año_credito | mes_credito | valor_credito_smdlv | estado_final | tipo_venta | periodo_credito | cuotas | forma_pago | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 266814 | 266815 | Nacional | NaN | NaN | NaN | SARAVENA | ARAUCA | NaN | Natural | NaN | NaN | NaN | SARAVENA | SIN CODEUDOR | PRIVADO | 2017 | 1 | 81 | DESCUENTO EN VENTA | ELECTRODOMESTICOS | SEMANAL(ES) | 14 | CRÉDITO |
| 266815 | 266816 | Nacional | NaN | NaN | NaN | SARAVENA | ARAUCA | NaN | Natural | NaN | NaN | NaN | SARAVENA | SIN CODEUDOR | PRIVADO | 2015 | 1 | 42 | PAGADO ANTICIPADO | ELECTRODOMESTICOS | SEMANAL(ES) | 8 | CRÉDITO |
| 266816 | 266817 | Nacional | NaN | NaN | NaN | ARAUCA | ARAUCA | NaN | Natural | NaN | NaN | NaN | SARAVENA | SIN CODEUDOR | PRIVADO | 2017 | 8 | 14 | PAGADO A TIEMPO | ELECTRODOMESTICOS | SEMANAL(ES) | 12 | CRÉDITO |
| 266817 | 266818 | Nacional | NaN | NaN | NaN | SARAVENA | ARAUCA | NaN | Natural | NaN | NaN | NaN | SARAVENA | SIN CODEUDOR | PRIVADO | 2016 | 4 | 71 | DESCUENTO EN VENTA | ELECTRODOMESTICOS | SEMANAL(ES) | 12 | CRÉDITO |
| 266818 | 266819 | Nacional | NaN | NaN | NaN | SARAVENA | ARAUCA | NaN | Natural | NaN | NaN | NaN | SARAVENA | SIN CODEUDOR | PRIVADO | 2017 | 11 | 16 | DESCUENTO EN VENTA | ELECTRODOMESTICOS | SEMANAL(ES) | 14 | CRÉDITO |
| 266819 | 266820 | Nacional | NaN | NaN | NaN | SARAVENA | ARAUCA | NaN | Natural | NaN | NaN | NaN | SARAVENA | SIN CODEUDOR | PRIVADO | 2011 | 8 | 50 | PAGADO VENCIDO | ELECTRODOMESTICOS | SEMANAL(ES) | 14 | CRÉDITO |
| 266820 | 266821 | Nacional | NaN | NaN | NaN | SARAVENA | ARAUCA | NaN | Natural | NaN | NaN | NaN | SARAVENA | SIN CODEUDOR | PRIVADO | 2016 | 8 | 44 | PAGADO VENCIDO | ELECTRODOMESTICOS | SEMANAL(ES) | 14 | CRÉDITO |
| 266821 | 266822 | Nacional | NaN | NaN | NaN | SARAVENA | ARAUCA | NaN | Natural | NaN | NaN | NaN | SARAVENA | SIN CODEUDOR | PRIVADO | 2017 | 10 | 52 | DESCUENTO EN VENTA | ELECTRODOMESTICOS | SEMANAL(ES) | 14 | CRÉDITO |
| 266822 | 266823 | Nacional | NaN | NaN | NaN | ARAUCA | ARAUCA | NaN | Natural | NaN | NaN | NaN | ARAUCA | SIN CODEUDOR | PRIVADO | 2007 | 3 | 7 | CONTADO | ELECTRODOMESTICOS | SEMANAL(ES) | 1 | CONTADO |
| 266823 | 266824 | Nacional | NaN | NaN | NaN | TAME | ARAUCA | NaN | Natural | NaN | NaN | NaN | TAME | SIN CODEUDOR | PRIVADO | 2017 | 9 | 117 | PAGADO VENCIDO | ELECTRODOMESTICOS | SEMANAL(ES) | 14 | CRÉDITO |